wo 2005/061705 



PCT/KR2004/003420 



WHAT IS CLAIMED IS: 

1 . A method of regulating expression of a gene in a prokaryotic cell, the method 
comprising: 

providing a prokaryotic cell comprising a nucleic acid encoding an artificial 
polypeptide, wherein the artificial polypeptide comprises a zinc finger domain, and wherein 
the artificial polypeptide binds to a target DNA site in a gene; 

expressing the nucleic acid encoding the artificial polypeptide in the cell 
under conditions in which the artificial polypeptide is produced, binds to the target DNA site, 
and regulates the gene.. 

2. The method of claim 1 , wherein the artificial polypeptide comprises at least 
three zinc finger domains. 

3. The method of claim 1, wherein the gene is an endogenous gene. 

4. The method of claim 3, wherein expression of two or more endogenous genes 
is regulated. 

5 . The method of claim 4, wherein the artificial polypeptide regulates expression 
of a polycistronic RNA. 

6. The method of claim 1 , wherein expression of the gene is repressed relative to 
expression of the gene in the absence of the artificial protein. 

7. The method of claim 1, wherein the cell is an E. coli cell. 

8. The method of claim 1, wherein the regulating alters a trait of the cell relative 
to a reference cell. 



9. 

resistance. 



The method of claim 8, wherein the trait is heat resistance or solvent 
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10. The method of claim 3, wherein the endogenous gene encodes a 
decarboxylase enzyme. 

11. The method of claim 1 0, wherein the decarboxylase enzyme is a 
decarboxylase enzyme of a ubiquinone biosynthetic pathway. 

12. The method of claim 1 1 , wherein the enzyme is a ubiX gene product. 

13. The method of claim 1, wherein expression of the nucleic acid encoding the 
artificial polypeptide is regulatable. 

14. The method of claim 3, furthCT comprising characterizing the endogenous 

gene. 

1 5 . The method of claim 1 4, wherein the characterizing comprises identifying 
DNA bound by the artificial polypeptide, and determining the nucleotide sequence of the 
endogenous gene associated with the bound DNA. 

16. The method of claim 15, wherein the isolating comprises cross-linking the 
artificial protein to the DNA, and immunoprecipitating the artificial protein. 

17. The method of claim 15, Anther comprising identifying a homolog of the 
endogenous gene in a second type of cell, and regulating the expression of the homolog. 

18. The method of claim 1 7, wherein the second type of cell is a prokaryotic cell. 

19. The method of claim 1 8, wherein the second type of cell is a bacterial cell. 

20. A method comprising: 

providing a plurality of prokaryotic cells, wherein each cell of the plurality 
comprises a nucleic acid encoding an artificial polypeptide, wherein the artificial polypeptide 
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comprises a zinc finger domain, and wherein the artificial polypeptide differs among the 
cells of the plurality; 

identifying firom the plurality a cell that has a trait that is altered relative to a 

reference cell. 

21 . The method of claim 20, wherein the trait is tolerance to an organic solvent, 
and wherein the identifying comprises exposing cells of the plurality to the organic solvent 
and evaluating survival of the cells. 

22. The method of claim 20, wherein the trait is heat tolerance, and wherein the 
evaluating comprises exposing the cells to heat. 

23. The method of claim 20, further comprising isolating the nucleic acid 
encoding the artificial polypeptide from the identified cell. 

24. The method of claim 23, further comprising sequencing the nucleic acid. 

25. The method of claim 20, further comprising isolating the artificial 
polypeptide from the identified cell. 

26. The method of claim 20, fiirther comprising isolating the nucleic acid 
encoding the artificial polypeptide from the identified cell, introducing the nucleic acid into a 
second plurality of cells, culturing the cells of flie second plurality under conditions wherein 
the artificial polypeptide is produced, and identifying a cell of the second plurality having a 
trait that is altered relative to a reference cell. 

27. The method of claim 20, further comprising determining the sequence of the 
target DNA site of the artificial polypeptide. 

28. . The method of claim 20, further comprising identifying an endogenous gene 
bound by the artificial polypeptide. 
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29. The method of claim 20, further comprising analyzing the expression of one 
or more genes of the cell. 

30. The method of claim 28, further comprising modifying expression of the 
endogenous gene in a second cell. 

3 1 . The method of claim 20, wherein the artificial polypeptide comprises at least 
three zinc finger domains. 

32. The method of claim 3 1 , wherein the zinc finger domains are yeast zinc finger 
domains, or variants thereof. 

33 . The method of claim 20, further comprising cultivating the identified cell to 
exploit the altered trait. 

34. A prokaryotic cell comprising: 

a nucleic acid encoding an artificial polypeptide, wherein the artificial 
polypeptide comprises a zinc fmger domain, and wherein the artificial polypeptide binds to a 
target DNA site isi a gene and regulates expression of the gene under conditions in which the 
nucleic acid is expressed. 

35. The cell of claim 34, wherein the artificial polypeptide regulates expression of 
an endogenous gene. 

36. The cell of claim 34, wherein the artificial polypeptide comprises at least 
three zinc finger domains. 

37. The cell of claim 35, wherein the gene is a decarboxylase. 

38. A cell selected by the method of claim 20. 
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39. A polypeptide comprising at least one zinc finger domain, wherein the DNA 
contacting residues of the zinc finger domain at positions -1, +2, +3, and +6 correspond to a 
motif selected firom: RSHR, HSSR, ISNR, RDHT, QTHR, VSTR, QNTQ, and CSNR, and 
wherein the polypeptide regulates an endogenous prokaryotic gene. 

40. The polypeptide of claim 39, further comprising a second and third zinc 
finger domain, wherein the DNA contacting residues of the first, second, and third domains 
at positions -1, +2, +3, and +6 of each domain respectively correspond to the motifs RSHR, 
HSSR, and ISNR. 

41 . The polypeptide of claim 39, firrther comprising a second and third zinc 
finger domain, wherein the DNA contacting residues of the first, second, and third domains 
at positions -1, +2, +3, and +6 of each domain respectively correspond to the motifs ISNR, 
RDHT, and QTHR. 

42. The polypeptide of claim 41 , fiirther comprising a fourth zinc finger domain, 
wherein the DNA contacting residues of the fourth domain at positions -1, 4-2, +3, and +6 of 
correspond to the motif VSTR. 

43. The polypeptide of claim 39, fiuther comprising a second and third zinc 
finger domain, wherein the DNA contacting residues of the first, second, and third domains 
at positions -1, +2, +3, and +6 of each domain respectively correspond to the motifs QNTQ, 
CSNR, and ISNR. 

44. A polypeptide comprising at least one zinc finger domain, wherein the DNA 
contacting residues of the zinc finger domain at positions -1, +2, +3, and +6 correspond to a 
motif selected from: QSHV, VSNV, QSNK, RDHT, QTHR, QSSR, WSNR, VSNV, RSHR, 
DS AR, QTHQ, RSHR, QSNR, and CSNR, and wherein the polypeptide regulates an 
endogenous prokaryotic gene. 

45. The polypeptide of claim 44, further comprising a second, third, and fourth 
zinc finger domain, wherein the DNA contacting residues of the first, second, third, and 
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fourth domains at positions -1, +2, +3, and +6 of each domain respectively correspond to the 
motifs QSHV, VSNV, QSNK, and QSNK. 

46. The polypeptide of claim 44, further comprising a second, third, and fourth 
zinc finger domain, wherein the DNA contacting residues of the first, second, third, and 
fourth domains at positions -1, +2, +3, and +6 of each domain respectively correspond to the 
motifs RDHT, QSHV, QTHR, and QSSR. 

47. The polypeptide of claim 44, further comprising a second, third, and fourth 
zinc finger domain, wherein the DNA contacting residues of the first, second, third, and 
fourth domains at positions -1, +2, +3, aad +6 of each domain respectively correspond to the 
motifs WSNR, QSHV, VSNV, and QSHV. 

48. The polypeptide of claim 44, further comprising a second, third, and jfourfh 
zinc finger domain, wherein the DNA contacting residues of the first, second, third, and 
fourth domains at positions -1, -1-2, +3, and H-6 of each domain respectively correspond to the 
motifs QTHR, RSHR, QTHR, and QTHR. 

49. The polypeptide of claim 44, further comprising a second, third, and fourth 
zinc finger domain, wherein the DNA contacting residues of the first, second, third, and 
fourth domains at positions -1, -1-2, +3, and +6 of each domain respectively correspond to the 
motifs DSAR, RDHT, QSHV, and QTHR. 

50. The polypeptide of claim 44, further comprising a second, third, and fourth 
zinc finger domain, wherein the DNA contacting residues of the first, second, third, and 
fourth domains at positions -1, -1-2, +3, and -1-6 of each domain respectively correspond to the 
motifs QTHQ, RSHR, QTHR, and QTHR. 

51. The polypeptide of claim 44, further comprising a second, third, and fourth 
zinc finger domain, wherein the DNA contacting residues of the first, second, third, and 
fourth domains at positions -1, +2, +3, and +6 of each domain respectively correspond to the 
motifs QSHV, VSNV, QSNR, and CSNR. 
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52. The polypeptide of claim 44, further comprising a second, third, and fourth 
zinc finger domain, wherein the DNA contacting residues of the first, second, third, and 
fourth domains at positions -1, +2, +3, and +6 of each domain respectively correspond to the 
motifs VSNV, QTHR, QSSR, and RDHT. 

53. The polypeptide of claim 44, further comprising a second, third, and fourth 
zinc finger domain, wherein the DNA contacting residues of the first, second, third, and 
fourth domains at positions -1, +2, +3, and +6 of each domain respectively correspond to the 
motifs RDHT, QSHV, QTHR, and QSNR. 

54. The polypeptide of claim 44, fiirther comprising a second, third, and fourth 
zinc finger domain, wherein the DNA contacting residues of the first, second, third, and 
fourth domains at positions —1, +2, +3, and +6 of each domain respectively correspond to the 
motifs DSAR, RDHT, QSNK, and QTHR. 

55. A nucleic acid encoding the polypeptide of claim 39. 

56. A bacterial expression vector comprising a nucleic acid encoding the 
polypeptide of claim 39. 
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